Credit Risk Assessment using Statistical and Machine Learning: Basic Methodology and Risk Modeling Applications

نویسندگان

  • J. Galindo
  • P. Tamayo
چکیده

Risk assessment of financial intermediaries is an area of renewed interest due to the financial crises of the 1980’s and 90’s. An accurate estimation of risk, and its use in corporate or global financial risk models, could be translated into a more efficient use of resources. One important ingredient to accomplish this goal is to find accurate predictors of individual risk in the credit portfolios of institutions. In this context we make a comparative analysis of different statistical and machine learning modeling methods of classification on a mortgage loan dataset with the motivation to understand their limitations and potential. We introduced a specific modeling methodology based on the study of error curves. Using state-of-the-art modeling techniques we built more than 9,000 models as part of the study. The results show that CART decision-tree models provide the best estimation for default with an average 8.31% error rate for a training sample of 2,000 records. As a result of the error curve analysis for this model we conclude that if more data were available, approximately 22,000 records, a potential 7.32% error rate could be achieved. Neural Networks provided the second best results with an average error of 11.00%. The K-Nearest Neighbor algorithm had an average error rate of 14.95%. These results outperformed the standard Probit algorithm which attained an average error rate of 15.13%. Finally we discuss the possibilities to use this type of accurate predictive model as ingredients of institutional and global risk models.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Genetic Algorithms for the Optimization of Support Vector Machines in Credit Risk Rating

The assessment of credit risk usually involves the development of rating models that classify credit applicants (firms or individuals) into predefined risk groups. A plethora of methodologies have been proposed to develop such rating models. Among them support vector machines (SVMs) have rapidly evolved in statistical learning theory as new modeling technique for developing classification model...

متن کامل

Classification of Customer’s Credit Risk Using Ensemble learning (Case study: Sepah Bank)

Banks activities are associated with different kinds of risk such as cresit risk. Considering the limited financial resources of banks to provide facilities, assessment of the ability of repayment of bank customers before granting facilities is one of the most important challenges facing the banking system of the country. Accordingly, in this research, we tried to provide a model for determinin...

متن کامل

Paper 1323-2017: Real AdaBoost: Boosting for Credit Scorecards and Similarity to WOE Logistic Regression

Adaboost is a machine learning algorithm that builds a series of small decision trees, adapting each tree to predict difficult cases missed by the previous trees and combining all trees into a single model. We will discuss the AdaBoost methodology and introduce the extension called Real AdaBoost. Real AdaBoost comes from a strong academic pedigree: its authors are pioneers of machine learning a...

متن کامل

Credit Risk Evaluation Using Support Vector Machine with Mixture of Kernel

Recent studies have revealed that emerging modern machine learning techniques are advantageous to statistical models for credit risk evaluation, such as SVM. In this study, we discuss the applications of the support vector machine with mixture of kernel to design a credit evaluation system, which can discriminate good creditors from bad ones. Differing from the standard SVM, the SVM-MK uses the...

متن کامل

Data mining with Support Vector Machine

Machine Learning is considered as a subfield of Artificial Intelligence and it is concerned with the development of techniques and methods which enable the computer to learn. In this paper introduce SVM. It is techniques and methodologies developed for machine learning tasks Support vector machines (SVMs) are a set of related supervised learning methods used for classification and regression. S...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 1999